SHARP AND FUZZY OBSERVABLES ON EFFECT ALGEBRAS 



JENCOVA, A., PULMANNOVA, S., VINCEKOVA, E. 

Abstract. Observables on effect algebras and their fuzzy versions obtained 
by means of confidence measures (Markov kernels) are studied. It is shown 
that, on effect algebras with the (E)-property, given an observable and a con- 
fidence measure, there exists a fuzzy version of the observable. Ordering of 
observables according to their fuzzy properties is introduced, and some mini- 
mality conditions with respect to this ordering are found. Applications of some 
results of classical theory of experiments are considered. 



1. Introduction 

In the frame of quantum mechanics, as a proper mathematical formulation of 
a physical quantity (so called observable), a normalized positive operator valued 
measure is considered, instead of the more traditional spectral measure (projection 
valued measure) . This approach has also provided a frame to investigate imprecise 
measurements of a physical quantity. In the literature (e.g., |21j). the notion of 
a quantum mechanical fuzzy observable has been formulated as a smearing of a 
sharp observable (projection measure). In the present paper, we study smearing 
of observables in a more general frame of effect algebras. In analogy with [21] 120] . 
we introduce the notion of a confidence measure (which is a Markov kernel), and 
we show that in a cr-orthocomplcte effect algebra with an order determining set of 
cr-additive states which has the (E)-property every confidence measure yields 
a smeared observable for a given (real) observable. We can then introduce a partial 
order for observables by putting £ ^ r) if 77 is a smearing of £; in this case we say 
that rj is a fuzzy version of £ }20| . If £ < rj and simultaneously, n -< £, we will write 
£ ~ 77, and say that £ and rj are fuzzy equivalent. In analogy with some recent papers 
[5] , minimal elements in this ordering are called (postprocessing) clean observables, 
or optimal measurements, [20] . 

As a motivation, we give the following [21 . Let L be a-orthocomplete effect 
algebra, (fi, A) a measurable space, and £ : A ~> L a, sharp observable on L, and m 
a cr-additive state on L. For every E G A, £(E) is a sharp element of L (recall that 
a G L is sharp if is the unique common lower bound of a and its orthosupplement 
a'), and m(£(E)) = J 5 u (E)m(£(dx)), where S U (E) = 1 if oj e E, and S U (E) = 
if u) (£ E. Since realistic measurements always have some imprecision, one may 
think that the points of fl are to be replaced by probability distributions. If we 
replace the Dirac function 5 U by a probability distribution v u : A — * [0, 1] in every 
point u), we obtain J fl v UJ (E)m(^(duj)), which is a smearing of m(£(E)). Under 
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some appropriate additional assumptions on L (which are satisfied in the case of 
the effect algebra £ (H) of the Hilbert space effects), there is a smeared observable r\ 
of £, such that m(r)(E)) = J„ u u (E)m(^(duj)) for every E G A and every a-additive 
state 77i. 



An effect algebra [16] (see [17] and [23] f° r alternative definitions) is a set L with 
two distinguished elements 0, 1 and with a partial binary operation ®:LxL->L 
such that for all a, 6, c G L we have 

(EAi) if a ffi 6 exists in L then 6 ffi a exists in L and a ffi 6 = 6 © a (commutativity); 
(EAii) if b ffi c exists in L and a © (6 © c) exists in L then a © b exists in L and 

(a © 6) © c exists in L, and (a © 6) © c = a © (6 © c) (associativity); 
(EAiii) for every a € L there is a unique b G L such that a © 6 = 1 (orthosupplc- 

mentation) ; 

(EAiv) if 1 © a is defined, then a = (zero-one law). 

As usual, we shall write L = (L; ©, 0, 1) for effect algebras. If the assumptions 
of (EAii) are satisfied, we write a © b © c for the element (a © 6) © c = a © (b © c) 
in L. 

Let a, 6 be elements of an effect algebra L. We say that (i) a is orthogonal to b 
and write a _L 6 iff a © b is defined in L; (ii) a is Zess than or equal to b and write 
a < b iff there exists an element c in L such that ale and a © c = b (in this case 
we also write b > a); b is the ortho supplement of a and write 6 = a' iff 6 is the 
(unique) element in L such that b _L a and a © 6 = 1. If a < 6, then the element c 
such that a © c = 6 is uniquely defined, and we write c = b © a. In particular, for 
every a G L, a' = 1 © a, a _L 6 iff b < a' , and (a © 6)' = a' © b. 

For a finite sequence a±, a-i, . . . , a n , n > 3, we define recursively 

(1) oi © • ■ • © a n := (ai © • ■ • © a„_i) © a„, 

supposing that (oi © • • • © o„_i) and (ai © • • • © o n -i) © a n exist in L. Due to 
associativity of ffi, the element ([1} is correctly defined. Define ai © • • • © a n = ai if 
n = 1, and ai ffi • ■ • ©a„ = if n = 0. Then, due to commutativity and associativity 
of ffi, for any permutation i-z,..., i n ) of (1,2,..., n) and any < k < n we have 

(2) oi ffi ■ • ■ ffia„ = Ojj ffi ■ ■ • ®a in , 

(3) ai ffi • • • ffi a„ = (ai ffi • • • ffi a fc ) ffi (a k +i ffi • ■ • © a„). 

We say that a finite sequence F — {ai, . . . , a n } is orthogonal if Oi ffi • • • ffi a n 
exists in L, and we say that F has the ®-sum @ F, which is defined by 



An arbitrary system G = {ai}i^i of (not necessarily different) elements of L is 
said to be orthogonal if for any finite subset J of/, the system {a^i^j is orthogonal. 
An orthogonal system G = {aj}i e / has an ffi-sum in L, if in L there exists the join 



where J runs over all finite subsets of /. In this case, we also write G := 



2. Effect algebras 



(4) 



F = oi ffi • • • ffi 



(5) 




V.7 
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Evidently, if G — {ai, . ■ ■ , a n } is orthogonal, then the ©-sums defined by ((U and 
§5§ coincide. 

Let G — {a,i} ie i and a* — a for all i 6 J, The greatest n such that © i<n a% 
exists, is called the isotropic index of a. If © i<n a, exists for all n E N, we say 
that the isotropic index of a is infinite. If © G exists and / is infinite, then a = 0. 
Indeed, let oq = ©G, then ao = aj © ©ieiUj} a « = a ® a 0; which gives a = 0. 
Notice that if G is only orthogonal, then a is not necessarily 0. 

We say that an effect algebra L is a-orthocomplete (orthocomplete) if ©j G j <Jj 
exits for any countable (arbitrary) orthogonal system {a.j : i £ J} of elements of 
L. We recall that an effect algebra is cr-orthocomplete iff for every nondecreasing 
sequence {aj}jgN there is a supremum a = \/ ieN ai. 

A mapping s : L — > [0, 1] from L to unit interval [0, 1] of real numbers is a state 
on L if (i) s(l) = 1, (ii) s(a © 6) = s(a) + s(b) whenever a © 6 exists in L. It is clear 
that s(0) = 0, and s(a) < s(b) whenever a < b, a,b e L. A state s : L — > [0, 1] is 
said to be a-additive, or completely additive if the equality 



holds for any countable, or arbitrary index set /, respectively, such that ® ie j a^ 
exists in L. 

A non-void system S of states on L is said to be order determining, if for a, b € L, 
a < 6 iff s(a) < s(6) for all s € 5. We denote by Conv{S) and Conv a (S) the 
convex and cr-convex hull of 5, respectively. Clearly, elements of Cortf(iS) and 
ConVn{S) are states on L, and moreover, 5 is order determining iff Conv{S) is 
order determining, or, equivalently, iff Conv a {S) is order determining. 

Let L and P be effect algebras, a mapping : L — > P is a morphism if (i) 
m(li) = lp, where and lp are the unit elements in L and P, respectively, and 
(ii) a -1 6 implies 0(a) _L </>(&), an d 0(a©6) = 0(a) ©0(6). A morphism is called a 
o~ -morphism ( complete morphism) if it preserves all existing countable (arbitrary) 
©-sums. A bijective morphism such that a _L b iff 0(a) _L 0(6), is an isomorphism. 
A cr-isomorphism, resp. complete isomorphism, is defined in an obvious way. 

A subset P of an effect algebra L is a sub-effect algebra, if (i) € P, 1 € P; (ii) 
a, 6 E P, a _L 6 implies a © 6 E P, (iii) a E P implies a' E P. 

We recall that an effect algebra is: 

- an orthoalgebra iff a _L a implies a = 0; 

- an orthomodular poset iff a _!_ 6 implies a © 6 = a V 6; 

- an orthomodular lattice iff it is a lattice ordered orthomodular poset; 

- an MV- effect algebra iff it is lattice ordered and the equalities (a V 6) © a — 
6 © (a A 6) are satisfied. We recall that MV-effect algebras coincide with 
MV-algebras introduced by Chang [5] as algebraic bases for many-valued 



Two of the most important prototypes of effect algebras are the following exam- 
ples. 

Example 2.1. Consider the closed interval [0, 1] of reals ordered by the natural 
way. For two numbers a, 6 £ [0, 1] define a©6 iff a + 6 < 1 and put then a©6 = a+6. 
Then [0, 1] is an orthocomplete effect algebra, and the effect algebra order coincides 
with the natural order of reals. With respect to this order, [0, 1] is a totally ordered, 
distributive lattice. We recall that {at} is orthogonal iff J2t at — 1j an d ©t a t — 



(6) 




logic. 
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at- There is only one state on [0, 1], namely the isomorphism so( a ) — a - Clearly, 
so is completely additive and the one-point set {sq} is order determining. 
We recall that [0, 1] is also a prototype of MV-algebras. 

Example 2.2. The set £ (H) of all self-adjoint operators A on a Hilbert space H 
such that < A < /, where is the zero and / the identity mapping, ordered by 
the usual order of self-adjoint operators, namely A < B iff (Ax, x) < (Bx, x) for all 
x e H. We define, on £{H), A _L B iff A+B < I, and then put A®B = A+B. Then 
(£(H); ©, 0, /) becomes an effect algebra, in which the algebraic order coincides with 
the usual order that we started with. A system {At}t of elements from £ (H) is 
orthogonal if At < /, where the summation is in the weak, or equivalently in 
the strong operator topology, and then @ t A t = J2t At- The system (£(H); ©, 0, /) 
is an orthocomplete effect algebra which is not a lattice [TS] . 

Denote by T(H) the set of all orthogonal projections on H . Then V(H) is a 
sub-effect algebra of £(H), which is a complete orthomodular lattice. 

We recall that £(H), as well as V(H), play an important role in the foundations 
of quantum mechanics and the theory of quantum measurements [2]- 



Let L be a cr-orthocomplete effect algebra, and (f2, .4) a measurable space. By 
an (f2, A)- observable on L we mean a mapping £ : A — > L such that 

(i) e(fi) = i; 

(ii) the system {£(£;)} ieN is orthogonal and £flj^,i E %) = 0^i£(^O when- 
ever Ei n Ej — 0, i ^ j, and Ej 6 4 for i > 1. 

If (0, 4) C (R, £>(R)), then an observable £ : .4 — > L is said to be a real observ- 
able. 

Let (Qx> Ai) be another measurable space, and let / : £1 — > Ox be a measurable 
function such that f~ l (A) G A whenever A € A%. If £ : A — ► 1/ is an observable, 
then / o £ : A i— > A e _4i is a (f2i, 4i)-observable on L. It is called the 

f -function of £. In particular, if £ is a real observable on L and / : R — > R is a 
Borel measurable function, then / o £ is also a real observable on L. 

If £ is a (fi, A) -observable on L, and s is a cr-additive state on L, then := 
so( ; 4-» [0, 1] is a probability measure on (fi,„4). If £ is a real observable, we 
denote by 



the mean value of £ in s whenever the right-hand side of the above equation exists 
and is finite. 

More generally, if £ : (X, A) — > L is an observable, then for any Borel measurable 
function / : X — ► R, /(£) is a real observable, and 



using the integral transformation theorem. 

The spectrum of a real observable £ is the smallest closed subset C of R such 



3. Observables on effect algebras 



(7) 





that £(C) = 1. 
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For an (0, ^-observable £ on L, let H(£) :— {£(A) : A £ A} denote the range 
of £. Recall that an element a £ L is called sharp if a A a' = 0, that is, is the 
only common lower bound of a and a! . Clearly, 0, 1 are sharp, and a is sharp iff a' 
is sharp. We will say that and observable £ is sharp if its range consists of sharp 
elements. 

Let us consider the following examples. 

Example 3.1. Let H be a Hilbert space, and £(H) be the effect algebra of Example 
12.21 Here the sharp elements coincide with projections. Indeed, if P is a projection, 
then P = P 2 implies Ph{I-P) = P{I-P) = 0, and conversely, for any A £ £(H), 
< A < I implies that A^AA^ < A^IAi, which yields < A 2 < A. Then 
Q < A - A 2 < A, and (I — A) — (A — A 2 ) = (I - A) 2 > yields I — A> A — A 2 , 
hence A — A 2 is a common lower bound of A and I — A. Hence, A is sharp iff 
A = A 2 , equivalently, iff A is a projection. Sharp observables on £ (H) are then 
exactly those whose ranges are in P(H). These observables are called projection 
valued observables (PV-observables, in short), while general observables are called 
positive operator valued observables (POV-observables, in short). Owing to spectral 
theorem, real (bounded) PV-observables are in one-to-one correspondence with 
(bounded) self-adjoint operators. 

Example 3.2. Let X be a nonempty set. A tribe over X is a collection of functions 
T £ [0, l] x such that the zero function 0(x) = is in T and the following is satisfied: 

(Tl) f£T => 1-/6 T; 

(T2) /, g £ T f+g := min(/ + g, 1) G T; 

(T3) /„ £ T, n £ N and /„ / f (pointwisc) =► / £ T. 

Elements of T are called fuzzy sets or fuzzy events. Sharp elements in T coincide 
with the characteristic functions contained in T. We put B(T) := {B C X : 
Xs £ 2"}) where is the characteristic function of the set B. Then B(T) is a 
cr-algebra of sets, which is isomorphic with the system of all sharp elements of T . 
The restriction of any cr-additive state on T to B(T) is a probability measure. Due 
to Butnariu and Element theorem [5] , every element in T is a measurable function 
with respect to B(T). Moreover, every cr-additive state m on T has an integral 
representation 

(8) m(f) = [ fdP, 

J x 

where P is the restriction of m to B(T), i.e., P(A) = m(xA)- 

Let £ be an (fi, .4.) -observable on T, Define z/ : XxA -> [0,1], z/(x, A) = £(yl)(x), 
where £(A) € T. The mapping has the following properties: 

(cl) for any fixed x £ X, v(x, .) is a probability measure on A] 
(c2) for any fixed A £ A, v(., A) belongs to T. 

Conversely, every mapping v : X x A — > [0, 1] with properties (cl), (c2) gives rise 
to an observable on T given by £(A) — v{., A). 

Clearly, an observable is sharp if its range consists of characteristic functions from 
T. In fact, if 7r is a sharp (Q, A)-observable on T, then n is a cr-homomorphism 
7r : A —> B(T), therefore there is an (A, B{T)) -measurable function g : X — > such 
that tt(A) = g~ l {A), A £ A, and v(x, A) = x g -^(A)(x) ([31]). 
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Example 3.3. Recall that an MV-algebra can be defined as a system (M, +, *, 0, 1) 
consisting of a nonempty set M, two constants and 1, a unary operation * and a 
binary operation + satisfying the following axioms: 

(MV1) a+b = b+a; 

(MV2) a+(b+c) = (a+b)+c; 

(MV3) a+a* = 1; 

(MV4) a+0 = a; 

(MV5) a** = a: 

(MV6) 0* = 1; 

(MV7) o+l = l; 

(MV8) (a*+b)*+b = (a+b*)*+a. 

The above axioms are equivalent with the original axioms introduced by Chang 
in [8] (see [7]). A partial order can be introduced on M by putting a < b iff 
a*+b = 1. With respect to this ordering, M becomes a distributive lattice, where 
a V b = (a*+b)*+b, a A b = (a* V b*)* . By putting a © b = a+b iff a < b* , we 
obtain an effect algebra (M; ffi, 0, 1), where a* is the orthosupplement of a for all 
a e M. Conversely, an effect algebra (L; ffi, 0,1) can be organized into an MV- 
algebra (i.e., it is an MV-effect algebra) iff L is a lattice, and for any a,b E L, 
the equality (a V b) © a — b © (a A 6) holds. The total operation + is defined by 
a+b = (a © (a' A 6)), and a* = a' 9 . An MV-effect algebra M is er-orthocomplete 
(er-MV algebra), or orthocomplete (complete MV algebra) iff M is a cr-lattice, or a 
complete lattice, respectively. Sharp elements in an MV-algebra M coincide with 
the idempotents in M, that is, a A a* — iff a+a = a. The set B(M) of sharp 
elements of M forms a Boolean subalgebra of M. If M is c-complete, then B(M) 
is a Boolean er-algebra [IB] . 

Every tribe is a cr-MV algebra with /+g = min(/ + g, 1), /* = 1 — /, and 
where the lattice operations V, A coincide with pointwise supremum and infimum, 
respectively, of [0, l]-valued functions on X. 

By the Loomis-Sikorski theorem for cr-MV algebras [27] , [12] , pQ , to every cr-MV 
algebra there is a triple (X, T, h) consisting of a tribe T of fuzzy sets on a nonvoid 
set X and a surjective cr-homomorphism (of cr-MV-algebras) h : T — > M, such that 
the restriction of h to B(T) maps the latter set onto B(M). 

Let M be a er-MV-effect algebra, and let (X, T, h) be its representation by the 
Loomis-Sikorski theorem. Let £ be an (fl, A) observable on M. For every A £ A, 
there is an /a € T such that = C(^); where /a is unique up to /i-null 

sets. Define i/ : V x A — * [0, 1] by putting z/(x, j4) = /a(^)- Clearly, for a fixed 
j4 € ^4, € T. Moreover, ^(j4) = H{va). Let be a disjoint sequence of 

elements of A, and put £ = Ui-^i- Then £(£?) = f Choose functions 

/, /(,» = 1, 2, . . . in T such that h{f) = £(E), h(fi) - ^(^), t = 1, 2, . . .. Then 
we have = ($) h(fi) = /i(min(^^ 1 1)). Consider an orthogonal sequence 
gi,i = 1,2,..., where gi = /i, and for i > 1, g 4 = fi A (gi + ... + gi-i)*. We 
have h(fi) = h(gi), and assume that h(gi) = h(fi) for i < k. Then h(gk) = 
Hfk A (gi + . . . fe _i)*) = ft(/ fc ) A /i( 5 i + . . . + 5fe _i)' = /i(/ fc ) A (/i(/i) © ... © 
Hfk-i))' = h(fk). We proved, by induction, that h(fi) = h(gi),i = 1,2,..., 
which entails that h({x : fi{x) ^ gi(x)}) = 0, i = 1, 2, . . . (we identify sets with 
their characteristic functions). Clearly, /j > 1 iff /> ^ <7i for at least one «, 

therefore : fi > 1} = Ui^i{ x : fi 9i} G ker(ft), and this entails that 
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h({x : f(x) ^ J2iLi fi( x )}) = 0- This shows that v (x, E) = Y^iL\ v ( x > E%) f° r au x 
up to an /i-null set. 

4. Smearing of observables 

4.1. Markov kernels. Let L be a u-orthocomplete effect algebra with a system S 
of c-additive states, and let (X, T) and (Y, Q) be measurable spaces. Let £ be an 
(X, ^-observable on L. Consider a mapping v : X x Q — > [0, 1] with the following 
properties: 

(i) for any fixed x € X, v x {.) := .) : (/ — » [0, 1] is a probability measure; 

(ii) for any fixed G £ Q, the mapping x \— * v G (x) := i/(x, G) is J-"-measurable. 

That is, v is a Markov kernel (we note that in analogy with |21j . i/ may be called 
also a confidence measure). Let m G 5. The integral 

/ v> G (x)m(£(dx)) 
Jx 

converges by the dominating convergence theorem. If there is an observable r\ : 
(Y, Q) -> L such that 

(9) m(v(G)) = / v G {x)m^{dx)) 

Jx 

for every m € <S, the we will call 77 a fuzzy version of £, or a smearing of £ m t/ie 
states m £ S. If moreover the system iS is order determining, then the equations 
<j9j) uniquely determine 77, and we call 77 simply a fuzzy version (smearing) of £. In 
this case, we will write £ -j< 77. If equation ([pj holds for every m € <S, we will write 
symbolically 

(10) 77(G) = [ v G (x)t(dx). 

Jx 

The relation -< is reflexive, since the mapping (x,G) 1— > 5 X {G) = xg( x ) is a 
Markov kernel, and £(G) = J xg{x)£,(cIx). It is also transitive. Indeed, let £ ^ 77 
and 77 r< (, where £ : (X,^) -+ L, 77 : (y, 0) -> L, ( : (Z,H) -» L, 77(G) = 
J x ^i(a;,G)^(da;), £(#) = J Y ^(y, H)rj(dy) = J Y v 2 (y, H) J x v 1 {x,dy)(,(dx). It is 
well known that 

(11) v 3 (x, H) := J y 2 (y,H)vi(x,dy) 

is a Markov kernel (see [2U] for a detailed proof). Therefore, -< is a preorder, and 
it can be made a partial order in the usual way. If £ -< 77, and 7/ -< £, we will write 
£ ~ 77, and we will say that £ and 7/ are fuzzy equivalent. Obviously, £ is a minimal 
element with respect to r< if r\ < £ implies 77 ~ £. Minimal observables are called 
deem in accordance with [5] . We note that in [20] , the relation < is defined in the 
opposite direction, and maximal elements are called optimal measurements. 

4.2. Weak Markov kernels. Let (SI, A) be a measurable space. Then (SI, A)) 
will denote the set of all probability measures on (O, A). 

The notion of a Markov kernel can be weakened as follows. Let (SI, A) and 
(fii,.Ai) be measurable spaces. Let V C M^Q, .A)), and let : SI x _4i — > K. We 
will say that v is a weaA; Markov kernel with respect to V if 

(i) uj 1— > ^(w, £?) is ,4-measurable for all i? G „4i; 

(ii) for every B e Ai, < B) < 1, P-a.e.; 
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(iii) v(ui,Sli) = 1, V-a.e. and v(uj,$) = 0, V-a.e. . 

(iv) if {B n } is a sequence in Ai such that B n H B m = for m ^ n, then 

^(w, [J S„) = 2J f (w, B„), V - a.e.. 

n n 

Note that a weak Markov kernel with respect to the whole (SI, A) is in fact 
a Markov kernel. 

It is easy to see that if v is a weak Markov kernel with respect to V , then 

(12) v(P)(B) := f v(u),B)P(du), B £ A\ 

Ja 

is a probability measure on Ai for all probability measures P e P. 

Let L be a er-orthocomplete effect algebra with an order determining system of 
cr-additive states S, and let (X, !F) and (Y,Q) be measurable spaces. Let £ be a 
(X, .F)-observable on L. If v : X x Q — > R is a weak Markov kernel with respect to 
P = {m o ^ : m G >S}, then 

f(rao()(B)= / v(oj 1 B)m o ^(dx) 
Jx 

is a probability measure on (Y, Q), and if there is an observable r\ on L such that 

v(mo£)(B) = m(r)(B)), 

for all _B e 5 and all m € S, then we will also call rj a fuzzy version (or a smearing) 
of £ (in the states m G 5, if the latter set is not order determining). If S is order 
determining, we also write £ ^ 

Remark 4.1. We note that a weak Markov kernel v : X X Q — ► [0, 1] (with respect 
to one probability measure P) is called a random measure in the literature. If 5 is 
the Borel er-algebra of subsets of a complete separable metric space Y, then there 
exists a regular version v* of such that ^* is a Markov kernel, and 

(13) VG G 5, t/(x, G) = u*(x, G), a.e.P 

(see, e.g. [5^1 VI. 1. 21.]). For a more general version, see Theorem 16.31 

Let L be a c-orthocomplete effect algebra with an order determining set of a 
additive states S, £ be an (X, JT)-observable on L, and (Y, Q) be a complete metric 
space with the Borel cr-algebra. Let v : X x Q — > [0, 1] be a weak Markov kernel 
with respect to "P = {m o £ : to G iS}. Then for every m G 5 there exists a Markov 
kernel j/* n . 

If, in addition, there is a faithful stat^H mo on L, then the regular version i/* lQ is 
the regular version of v for all to G 5. This will follow from Theorem 16. 31 

Example 4.2. 1. We can see that in Examples 13.21 and 13.31 Markov kernels are 
closely related to observables. Namely, in Example 13.21 observables coincide with 
certain Markov kernels. 

In Example 13.31 we have the following situation. Let (X, T, h) be the Loomis- 
Sikorski representation of a tr-MV-algebra M. Let £ : (SI, A) — > M be an observable. 
For every A £ A, choose Ja £ T such that £(A) = /i(/a) (for defmiteness, we may 



We recall that a state mo on an effect algebra L is faithful if mo (a) = => a = 0. Clearly, 
for every state m on L, mo(a) = m(a) = (a £ L), whence for every observable £ it 

holds m o £ <g; mo o £. For example, if H is a complex, separable Hilbert space, then there exists 
a faithful state mo on £(H). 
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choose the (unique) continuous function in the corresponding class), and define 
v(x,A) = Ja(x), then v : X x A — > [0, 1] is a weak Markov kernel with respect to 
the family V = {m o h}, m a cr-additive state on M, of probability measures on 
B(T). Owing to Butnariu-Klement theorem we have 

m(Z(A))=m(h(f A ))= f } A {x)P{dx), 
Jx 

where P = mo h/B{1) = to o (h/B(T)). The restriction h/B(T) : B(T) -► B(M) 
can be considered as a sharp observable on M, and any other observable may be 
considered as a smearing of it in all cr-additive states on M. 

2. Let rj, £ be real observables on an effect algebra L such that rj = f o £ for some 
Borel function / : R — > M. Then for every cr-additive state m on i, m(ri(E)) = 
m(Z(f-HE)) = f Xf-HE)WmW\)), E G B(R). Put i/(A,f?) - X/-i( £) (A). It is 
easy to see that v is a Markov kernel, and r\ is a smearing of £. However, if £ is 
sharp, then rj is sharp, too. Hence a smearing of a sharp observable may be sharp 
as well. 

4.3. POV-measure with commuting range. In this section, we keep our stand- 
ing assumption that L is a a-orthocomplete effect algebra, and S is an order deter- 
mining set of cr-additive states on L. 

Let £ : (X, A) — ► L and rj : (Y, B) — ► L be observables on i. Assume that rj is a 
smearing of £, hence there is a (weak) Markov kernel v{x, B):IxB->[0,l] such 
that for every to £ 5, and E £ B, 

m(r](E)) = [ u E (x)m(£(dx)). 
Jx 

For fixed, z^e : A" — > [0, 1] is a measurable function, and the right hand side of 
the above equality is a mean value of the observable ve(Q m the state to, so that 
for every cr-additive state to, 

m{r,{E))=m{v E {0). 

Define the observable A- v (e) d Y ^(^{l} = r](E), A^^jO} = rj{E)' , then we 
obtain, for all m G S, 

m(\(E)) = m{v E (£,))- 

Theorem 4.3. Let £ : (X, A) — > L and rj : (Y,B) — > L be observables on L, such 
that rj is a smearing oft; with a (weak) Markov kernel v such that for all Borel sets 
E, v(x, E) G {0, 1} a.e. to o £, m G S. Then TZ(n) C K(£). 

If rj and £ are sharp, and rj is real then also the converse statement is true. 

Proof. Let E G B. Under the hypotheses, we have for every to G 5, 
m(rj(E)) = J v{x,E)m{^(dx)) 

= I v{x,E)m(£(dx)) 

J {x:v(x,E) = l} 

= m(Z({x:v(x,E) = l})). 

Since S is order determining, we obtain rj(E) = £ 7^(0- 

We note that in this case we have h- v {E) = v e{€). 
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Let both r\ and £ be sharp, and assume that TZ(rj) C 72(£). Since the range of a 
sharp observable is a Boolean cr-algebra [14] , if 77 is real, we can apply [311 Theorem 
1.4], to obtain that there is a measurable function / : X — > R such that 



for all Borel sets SCR, and the function / is unique up to a £-null set. Putting 

v{x, E) = Xf- 1 (E)i we obtain 



for all states m £ S. Hence 77 is a smearing of £ with a Markov kernel u(x, E) = 



Theorem 4.4. On the effect algebra £(H) of a separable 77 , an observable (POV- 
measure) is a smearing of a sharp observable (PV-measure) if and only if the range 
72.(77) °f V consists of mutually commuting effects. 

Moreover, a sharp real observable n is a smearing of a sharp observable £ if and 
only z/ 72(77) C 72(£), equivalently, if and only if 77 is a function o/£. 

Proof. Let 77 : (Y, 23) — » £(77) be a POV- measure that is a smearing of a PV- 
measure £ : (X, A) — » £{H). Then for every set P, and every state m, m(r](E)) = 
J x v{x, E)m(£(dx)) = m(i/£;(£)), which implies that 77(7?) = ve(Q, where ve(£) is a 
function of £. It follows that all the spectral projections of the self-adjoint operator 
77(7?) belong to 72(£), and this implies that 72(77) consists of mutually commuting 
effects. 

Conversely, let the range of a POV measure 77 consist of commuting effects. By 
well known von Neumann theorem (see also [31]), there exists a self-adjoint operator 
V and Borel measurable functions f E such that 77(E) = /e(V). Then we have, for 
every state m, 



where P v is the spectral measure of V. Define v(x,E) := f E (x). We will show 
that v is a weak Markov kernel. 

(i) Since < r\(E) < 1, < f E {x) < 1 on the spectrum of V, hence < f E < 1 
a.e. m o P y for all m. 

(ii) f Y {V) = 77(F) = 1 implies that J x v(x, Y)m(P v (dx)) = 1, and as < 
v(x,Y) < 1, we get v(x, Y) = 1 a.e. m o P^ for all m. Similarly we show that 
u(x, 0) = a.e. m o P 17 for all m. 

(hi) Let E = [J^i -^i) where Pj n Pj ■ = whenever i ^ j. From 



77(P) -£(/-!(£) 




X/-ub)(^) e {0,1}. 



□ 




00 



(14) 




(the convergence in weak sense), we obtain that 



(15) / v{x,E)P v {dx) v{x,Ei)P v {dx) 



Moreover, for every n, rjQJ" =1 P;) = ^™ =1 ^(^O entails /y» =i (7) = £™ =1 f E% (V) = 
(52i=i /Bi)(V), which entails that /y? (a;) = X)"=i fEi(x) on the spectrum of V". 
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From this we derive that < v(x, U™=i ^i) ~ ST=i u ( x > ^i) < 1 for all n. Therefore 
Si=i v{x,Ei) exists, and (fT4|) yields that 

oo 

f E (V)=J2fE,(V), 

i=l 

whence = Xli=i /si( a; ) on the spectrum of V. We conclude that v{x,E) = 

Y^iLi Ei a - e - m ° -P fo r every state to. This concludes the proof that v is a weak 
Markov kernel, and 77 is a smearing of £ := P v . 

Let both £ and 77 be sharp, and let 77 be real. If 72.(77) Q 72.(0; Theorem 14.31 
implies that 77 is a smearing of £. 

Conversely, assume that 77 is a smearing of £ with a (weak) Markov kernel v. For 
every state to, and every E E B(R), m(rj(E)) — f u(x,E)m(^(dx)) — m(^(£)). We 
may also write to(A i) ( £ ;)) = to(^e(£)) for every to, where A^^) is the 0—1 observable 
associated with r](E), which yields ^n(E) — v e(Q- Then T](E) — A r) ( B ){l} = 
CC^sM 1 }) e 72(£) . We obtained that 72.(77) C 72(£), equivalently, that 77 = /(£) for 
a measurable function /. □ 

4.4. Some examples of minimal observables. 

Example 4.5. Let L be any cr-orthocomplete effect algebra, and let ai, 03, . . . , On 
be elements of L such that ©i< n ai = 1. Choose real numbers 7*1, 7*2, ■ ■ ■ , r n . Then we 
may construct a (real) observable £ on L by putting £(E) — ®{i-. ri< zE} a ii E 6 B(R). 
We clearly have £({f*j}) = a,;, i < n, and {ri, r2, . . . , r n } is the spectrum of £. 

Now let L = £(H), where H is a finite dimensional Hilbert space. Let A\, A2, ■ ■ . , 
A n be effects in £ (H) such that ^ i<n A$ = 1. That is, A±, A2, . . . , A n is a resolution 
of unity in £(H). Let 77 be a real observable on £ (77) such that f](E) = J2{i- a -eE} ^-i, 
E G 13 (M), where cti,i < n are real numbers. Clearly, 77(04) = A4, i < n, and 
{04 : i < n} is the spectrum of 77. Notice that every POV measure 77 on a finite 
dimensional Hilbert space H is of this type, and r]({ai}), i < n, are atoms of the 
range 72.(77) of the observable 77. 

Since every Aj,i < n is a self adjoint operator on 77, it has a spectral de- 
composition Ai — X)j=i a ij-Rj > where P^'s are one dimensional projections with 
Sjii — 1> an< i — a ij — 1 are eigenvalues of Ai (not necessarily all differ- 
ent). The elements 73, . := aj.Pj. are effects in £(77). Owing to X^<n ^ = 1, we 
have X)i<n Xw=i -^ij = 1- By the first paragraph, we may choose real numbers 
{(3i )i- and construct an observable £ such that £(/?i ) = 73.;., and more generally, 
e(S)=E fe : %eE} Bi^GB(K). ' 

Define / : M -> R by f(f3 i} ) = a h i = 1, . . . , n, j = 1, . . . , h and /(r) = 
if r ^ fti- for all ij. Since the range of / is finite, it is measurable. More- 
over, / _1 (ai) = {0i s ,3 = 1)2, hi}. Therefore, 77(04) = A t = £\ : //, = 

Sjii Cifiij) — £(/ _1 ( a i)- Hence 77 = / o £, and hence 77 is a smearing of £. 

We have the following conclusion: if the range of a POV observable 77 contains 
atoms with rank greater than 1, then there is an observable £ with £ ^ 77. It follows 
that 77 is not minimal. The converse statement is also proved in [5]. Now we will 
rewrite it in our setting. 

Assume that 77 is a POV measure with the spectrum {yj}j< n such that rj(yj) 
for every j < n is an effect of rank one, that is, a multiple of a one-dimensional 
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projection. Assume that 77 is a smearing of a POV £. This entails, for every j, 

i 

where i < m for some m G N and we put, for every j, = {i : v(xi,yj) 0}- 
Observe that for every i, J2j v (. x i, Dj) — 1; since v(xi, .) is a probability measure. 

Since 77(2/3) is rank one, we have = PiViyj) Vz 6 £(i), with < ft < 1. This 
yields 

(!6) »?(%•)= XI "fa'VjWrtyj)- 

Define a? := v(x i ,yj)l3 i , then implies X«e£(j) a i = ^ a iV(yj) = v( x i,yj)£,(%i), 
and from 

3 

we get, putting v(yj,Xi) := a?, if i G ^(j), v(yj,Xi) := otherwise, 

(17) f(x») =^2p{yj,x i )rj(y j ), 

j 

which shows that £ is a smearing of 77. This shows that £ ^ 77 implies 77 ~ f , i.e. 77 
is minimal. 

Example 4.6. Let L be a er-orthocomplete effect algebra, (CI, A) a measurable 
space. Let 0(0, A, L) denote the set of all observables on L with the value space 
(CI, A). Let O C O(0, A,L), We say that an observable £ is minimal in O (or 
O- clean), if for any rj £ O, the condition 77 ^ £ implies that 77 ~ £. 

Recall that any element a in an effect algebra L defines an observable £ a with 
the outcome space SI = {0, 1} by 

f({l}) = a, £°({0}) = a'. 

Observables of this type are called 1-0-observables. 

In [20] . minimality in the class O({0, 1},H) of the 1-0 observables on the effect 
algebra £(H) is considered. In accordance with [2D], the 1-0 observable corre- 
sponding to A G £{H) will be denoted by E A . We recall that the set O({0, 1}, H) 
is convex, and 

\E A + (1 - A)£ B = £?AA+(1-A)B 

whenever AG [0,1]. 

Proposition 4.7. [2DJ Lei A,fle £(#) and Zei E A ,E B be the corresponding 1-0 
observables. Then E B -< E A if and only if there are numbers s,t G [0, 1] such that 
A = tB + sB'. 

As a consequence of Proposition 14.71 we obtain that E A ~ i? B iff A = B or 
A = B'. 

Proposition 4.8. [20, Prop. 3] Let A G £{H). The observable E A is minimal in 
O({0,l},H) if and only if \\ A \\ = \\ A' ||= 1. 
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We note that the extreme elements in the convex set £(H) are projection oper- 
ators [10, Lemma 2.3]. According to proposition 14. 8[ for every projection P, the 
corresponding 1-0 observable E p is minimal in 0({O, 1}, H), but there are also other 
minimal observables. However, in accordance with Example l4.5i if diraH > 3, then 
the observable E A for any effect A is not minimal in the set of all real observables 
on £(H). 

5. Smearings of observables on effect algebras with the 

(e) -property 

Notice that if S is an order determining system of states on an effect algebra L, 
then by replacing S by its (cr)-convex hull Conv{S), we may always assume that S 
is a (er)-convex set. 

Let L be a cr-orthocomplete effect algebra with an order determining system S 
of er-additive states, (X, F) be a measurable space. Every observable £ : T — > L 
can be characterized by a mapping <f>£ : S — » (X, F) defined by 

(18) $t(m)(F) = mo£(F), to G S,F G J' 7 . 

Here F ^ m o £(F) — $f (m) (F) , f e f is the probability distribution of the 
observable £ in the state m. 

We will try to find conditions under which to given observable and Markov kernel 
there exists a fuzzy version. For the sake of simplicity, we will concentrate to real 
observables. 

The following definitions were introduced in [TTj. Let S ^ be a convex set. A 
mapping f : S x B(R) — > [0, 1] such that 

(i) given m G S, f{m, .) is a probability measure on S(R), 

(ii) for any E e B(R), /(Am! + (1 - A)m 2) J5) = Xf(mi,E) + (1 - X)f(m 2 ,E) 
whenever A G [0, 1] and mi, m 2 G 5 

is said to be a a-effect function on 5. 

Definition 5.1. Let L be an effect algebra. We say that a convex system S of 
states on L has the (E)- property (E as for existence) if, given a cr-effect function / 
on S, for any £7 G £>(R) there exists an element £(E) G L such that 

(E) /(m,£)=mP),meS. 

It was shown in [TT] that the set of all cr-additive states on the effect algebra 
£{H) has the (E)-property. 

Theorem 5.2. Let L be a o -orthocomplete effect algebra and let S be a convex 
order determining system of a -additive states on L which has the (E)-property. 
Then for every effect function f on S , the mapping E \— > £(E) from B(M) —> L is 
an observable on L. Moreover, m i— > /(to, .) = $^(m). 

Proof. Since S is order determining, the element £(E) is uniquely defined by prop- 
erty (E). Moreover, £(R) = 1. Let (K i ),^ 1 C B(R) be such that E 4 H Ej = 
whenever i =/= j. Then for any i =/= j, 

1 > s(^u£ i )) = /(m,£? i U£; i ) 

= /(to, £i) + /(to, ^-) = m(t{Ei)) + m(^)), 
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which implies that _L £(Ej) and £(Eil)Ej) = £(Ei)®£(Ej). By induction wc 

prove that C(U?=i E i) = ®ti Put E := [jZi E i- Thcn for cver y m e 5, 



m(£(£)) = /(m,Q^) = E/K^) 

i=l i=l 
n n 

= lim V/(to,£ 4 ) = lim Vm(£(^)) 

n— »oo ' * n — ►oo z — ^ 

i=l i=l 

n oo 

= lim m(0^))=m(0^)), 

i=l i=l 

where the last equality holds owing to cr-additivity of to. Since S is order deter- 
mining, we obtain £(E) = 0°^ □ 

Theorem 5.3. Let S be a convex order determining system of a-additive states 
on a a -orthocomplete effect algebra L such that S has (E) property. Then, given a 
real observable £ on L, and a Markov kernel i/:Ix B(M.) — > [0, 1] there is a fuzzy 
version r\ of £. 

Proof. Let £ and f be given. For every meS and £ e £>(R), define 

(19) /(m,£7) := f u E {x)m^{dx)). 

Jr 

We will prove that / : S x B(R) — > [0, 1] is an effect function. (i):Let to G 5 be 
fixed, and let (-Ei)i=i be a sequence of disjoint sets from B(M). Put £7 = lJi=i Ei. 
Then ^_e(x) :— v(x,E) = J2iLi v Ei{x), and by additivity of the integral, for every 

neN, 



/(*, M Ei)m{ti{dx)) = V / ^(ai.SjJm^dic)). 
i=i i=i Jr 

Since v\j?_ 1 E i / v e pointwise, we have 

lim / i/(a;, I J Ei)m(£(dx)) = / v E (x)m(£,(dx)), 

hence /(to, £7) = ^ l ^i / (m, E t ). 

(ii): Let to = aTOi + (l-a)TO 2 , then for every £ £ m(£(E)) = ami (£(£))+ 

(l-a)m 2 (£(£)). Therefore 

f(m,E) = / v E {x)m{^{dx)) 

JR 

= / v E (x)(ami(£(dx)) + (1 - a)m 2 (£(dx))) 
Jr 



= a v E {x)mi(€(dx)) + (1 - a) / u E (x)m 2 (^(dx)) 
Jr Jr 

= af(m 1 ,E) + (l-a)f(m 2 ,E). 

This proves that / is an effect function. Then the (E) property entails that there 
is an observable r\ such that for every to e 5, E 6 M, m(-q(E)) = J R E)m(^(dx)), 
that is, 77 is a fuzzy version of £. □ 
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6. Stochastic operators and Markov kernels 

At the beginning, we introduce some notations. Let M(Cl, A) denote the vector 
space of all complex measures on (Jl,A). Then M(fl,A) is a Banach space with 
the total variation norm || /i || = | \x | (O). 

Let 11 be a cr-finite measure on (12, A), we denote by L(/j.) the subspace in M(0, A) 
generated by all P £ (17, A) such that P <C /J. The space L{p) can be identified 
with Li(fl, A, fi), by extension of the map P i— » If P C M+(fi,.A), then we 
denote by L(P) the subspace generated by Upep L(P)- 

A stochastic operator is an affine map 

T : X -> Ml(Qi,Ai). 

where M. is a convex subset in M-^ (12,^4). Any stochastic operator can be ex- 
tended to a positive, norm - preserving map from the Banach subspace in M(fi, A) 
generated by M, to M(VLi,A\). 

Example 6.1. 1. Let A\ C A be a sub- a- algebra. Then the restriction map 
T Al : M+(n,A) -» M+(n,Ai), P ' ^ P/ii 

is a stochastic operator. 

2. Let F : (12, A) — > (Oi,^4i) be a measurable map. Then F defines the 
stochastic operator 

T F : M+{Q,A) -» (fii,^), P i * P F , 
where P F is the distribution of F under P, that is, 

P F (P) = P(F- 1 (P)), Be A x . 

3. Let f : 12 x _4i — > K be a weak Markov kernel with respect to V. Then 
T, y : P i— > f(P) defines a stochastic operator T„ : L(P) — > M(wi,^4i). 

A stochastic operator is called a statistical map if there is a Markov kernel such 
that T = T V . Note that operators and Tp in Example lG.ll are given by Markov 
kernels. Indeed, if we put 

(20) v Ax (u,B) =xs(w), ^f(w,-B) = 
then TUj = T„ M , and Tp = T Vp . 

Proposition 6.2. [H [3] ^ stochastic operator T : Mf(Q,,A) -> M^fii, Ai) is a 
statistical map if and only if for every B £ Ai there is an A-measurable function 
: 12 ^ [0, 1] such that 

(21) f (TP)(dui) = f f B (u)P(doj) 
for every P £ M-j^ (12, A). 

Proof. If there is a Markov kernel v such that T — T„, we put /p(w) = v{u>, B). 

Conversely, v{ui,B) := /b(w) is a Markov kernel. Indeed, for every to* £ 12 let 
<5 W » denote the corresponding Dirac measure. Equation (f2Tj) implies that T<5 W » (P) = 
J n fs{'jj)5u>*{dui) = /p(w*), which immediately implies the desired result. □ 
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Let £ : (X, T) — > L and rj : {Y,Q) — > L be observables such that £ ^ 77, and let 
v : X x — > [0, 1] be the corresponding confidence measure. Then the equation 

(22) m(??(G))= / ^(x,G)m(^(dx)), G &Q, m E S 

can be rewritten in the form 

(23) % = T v o $ ? , 

where T„ is the statistical map corresponding to v, and $^ is defined by (jTHJ). 

In general, there is a little hope for a stochastic operator T : M(Cl,A) — > 
JV/(f2i,^4i) to be given by a Markov kernel (see e.g. [4]). However, for stochas- 
tic operators defined on L(/j,) it is often the case. 

Let us recall that the measurable space {X, B) is a standard Borel space if X is a 
complete separable metrizable (Polish) space and B is the Borel er-algebra over X. 

Theorem 6.3. |30j . Let n be a a -finite measure on (Q, A) and (X, B) be a standard 
Borel space. Let T : L(p) — » M(X, B) be a stochastic operator. Then there is a 
Markov kernel v : Q x B —> [0, 1] such that T — T u /L(p). 

Let £ be an (Q, ,A)-observable on L, and let mo be a faithful cr-additive state on 
L. Then for every m 6 S, we have m o £ <C mo o £. By Theorem 16. 3[ to every 
stochastic operator 

T:L(moo^)^M(X,B), 

where (X, B) is a standard Borel space, there is a Markov kernel v : £1 x B — > [0, 1] 
such that T = T u /L(m o £). 

6.1. Coarse graining. We keep our assumption that L is a cr-complete effect 
algebra and 5 is an order determining system of cr-additive states on L. 

Definition 6.4. Let £ and ij be observables on L with value spaces (X,T) and 
(Y,Q), respectively. We say that r\ is a coarse graining of £ if there is a stochastic 
operator T : M±(X, T) -> Af^Y^) such that = To $ £ , where $ ? is defined 
by (EH. ' 

Our previous discussion shows that if £ ■< 77, then 77 is a coarse-graining of £. If 
77 is a coarse graining of £, and conditions of Theorem 16.31 are satisfied, then £ X 77 
holds. 

In particular, if L = £(i/) (i? separable), then for any observable £ : (Q, .A) — * L 
and any faithful state m , every stochastic operator T : I/(moj) — > M(X, B) (where 
(X, B) is a standard Borel space) defines a Markov kernel z/, and hence a fuzzy 
version 77 of £ with the confidence measure za If (-X~,,B) = (K,B(M)), then r\ is a 
real observable. 

7. Application of the classical theory of experiments to quantum 

observables 

Representations of observables by probability distributions enable us to apply 
some results from the classical theory of experiments to quantum observables. 

A (statistical) experiment (or model) is a triple X = (Cl,A,V), where (ft, A) is 
a measurable space and V is a nonempty family of measures in M^(Q,A). (CI, A) 
is called the sample space of the experiment X. 
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7.1. f-divergence. Let P,Q £ M£ l "(0,^4). Recall that a Lebesgue decomposition 
of P with respect to Q is any pair (/, iV),such that / : — > R is measurable, / > 0, 
iV G A Q(iV) = 0, and 

P(A)= / /dQ + P(A n iV), A e A 

The function / is called the likelihood ratio of P with respect to Q and denoted by 
/ = dP/dQ. For example, if P <C Q and (dP/dQ, N) is a Lebesgue decomposition, 
then P(N) = and dP/dQ is the Radon-Nikodym derivative. If both P and Q are 
dominated by a <7-finite measure and p = dP/dfi, q = dQ/d^i, N = {q = 0}, then 
(p/q, N) is a Lebesgue decomposition of P w.r. Q. 

Let P,Q £ M± (CI, A) and let (dP/dQ,N) be a Lebesgue decomposition. Let 
/ : [0, oo) — > K be a convex function. We define the f-divergence of P with respect 
to Q bv [22] 

/" dP 

(24) imp, q) = y n /(^q)^ + w/=o, 

where /oo = lim^^oo It can be seen that D/ does not depend on the choice 

of the Lebesgue decomposition. 

The /-divergence of P with respect to Q is a generalization of the total variation 
of P - Q. Choosing f(u) :=| u - 1 | for all «el + we obtain D f (P, Q) = 
|| P — Q ||. More generally, if / is a strictly convex function, satisfying /(l) = 0, 
then D f (P,Q) > for all P, Q and D f (P,Q) = if and only if P = Q. In this 
sense, £>/ can be seen as a quasi-distance in M^" (fi, ^4). 

For example, if j(x) = —\og(x), then Df(P,Q) is the well-known /-divergence 
(Kullback - Leibler divergence, relative entropy) 

I{P>Q) = / (logq-logp)qd^, 
Jn 

here n is a dominating measure and p — dP/dfi, q = dQ/dfi. 
Another example is the Hellinger distance 

H{P,Q) = \jy /2 -q 1/2 ) 2 dv 

obtained by the choice f(x) = /j?(ar) = (1 — x 1 / 2 ). For more examples and facts 
about /-divergences, see [2l)] . 

Let ^ < rj. The relation ([9]) implies that if for two states m^ma we have 
mi o £ = 7712 ° f j then also mi o rj = mi o 77 holds. That is, the discerning power of 
£ with respect to states is greater than that of 77. A strengthening of this result is 
given by the following monotonicity theorem. In [301 122] , it is proved for Markov 
kernels, but the proof works also for weak Markov kernels. 

Theorem 7.1. Let P,Q £ Aff(Q, A) and let v : Q x A\ -> [0, 1] be a weak Markov 
kernel with respect to {P, Q}. Let f : [0, 00) — > R be a convex function. Then 



D f (P,Q)>D f (u(P),u(Q)). 
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7.2. Sufficient Markov kernels. Let (0, A, V) be an experiment, (fli, Ax) a mea- 
surable space and v : x .4.1 — > [0, 1] a Markov kernel. According to (fT2|) . v assigns 
to every P £ V a measure z/(P) £ (yix.Ax) by 

^(P)(Ax) = / u(x,A 1 )P(dx). 
J a 

Note that for P, Q £ M+(fl,A), Q < P implies < i/(P). Indeed, if P e .Ai 

is such that v(P)(B) = J v(w, B)dP(uj) = 0, then, since u(u, B) > 0, we must have 
v(w,B) = 0, P a.e. But then also / v(w, B)Q(duj) = v{Q){B) = 0. 

For a measurable function / : (SI, ^4) — > [0, 1] and P s (Q,A), we define the 
measure / • P as 

/ • := f fdP 

J A 

Then clearly / • P < P, hence z/(/ • P) < v{P)- Let us define 

P£(/) :=du(f -P)/dv{P). 

Definition 7.2. J22J, Definition 22.1] Let (fl,A, V) be an experiment, (Qi,Ax) a 
measurable space and v : Q x Ai — > [0, 1] a Markov kernel. 

(a) ^ is called Blackwell sufficient (for P) if there exists a kernel i/' : fir x A — > 
[0, 1] such that f'(i^(P)) = P holds for all P e P. 

(b) ^ is said to be sufficient (for P) if to every A £ A there exists a measurable 
function qa '■ (Cli,Ai) — ► K, such that 

(25) Pp(xa) = <M, i/(P) a.e. for all P £ P. 

Clearly, for the observables £, 77 on L, such that £ ^ 77, we have £ ~ 77 iff the 
corresponding Markov kernel v is Blackwell sufficient for the measures mo£, m £ S. 

Remark 7.3. In classical statistics, sufficiency of a sub- er- algebra A\ (or a statistic) 
for an experiment means the existence of common versions of the conditional proba- 
bilities P(A/ A\) for all P £ P. The above definition is a generalization of this well- 
known notion: if A\ C A is a sub-a-algebra and 7/ = 2/4^ , then E p (xa) = P(A/A\). 

Remark 7.4. Let us fix P £ M^(fl, A). Let us define a map v' P : f2i x .4 — > K by 

i^K.A) :=^( X a)(wi). 

Then we have 

(26) / u(u,B)P(du) = [ u' P (uj 1 ,A)u(P)(dui 1 ),A£A,B£Ai. 

J A JB 

We prove that v' P is a weak Markov kernel with respect to v{P). 

By definition, we know that u\ 1— * i/' P (u>i, A) is measurable for all A £ A. This 
shows (i). Moreover, u' p (u>i, A) > 0, v{P) a.e.. Moreover, let B = {lo\ : i/' P (u>i, A) > 
1} and suppose that v(P)(B) > 0. Then by (|26|). 



v{P){B) < [ i/ P (u 1 ,A)v(P)(du 1 )= [ v{uj,B)P{dio) <v(P){B). 

JB J A 
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It follows that v{P)(B) = 0, whence u' p (uji,A) < 1, v(P) a.e., and (ii) is shown. 
We have 



P(f2) = / u(oj, fii)P(dw) 
fp(wi,0)i/(P)((ia;i) 



1 = 



sir 

v'p{ldx, O) = 1 a.e. i/(P) 

Similarly we show that i/p(wi, 0) = a.e. f(P) which proves (iii). Finally, let {A n } 
be a sequence in A, such that A„ n A m = if n ^ to. Then for Be^, 



/ i/^(wi,||A„)i/(P)(da;i) = / ^(w, B)P{duj) = V / v{uj,B)P{duj) 

= / V^(wi,4>(P)(4;i) 



which proves (iv), so that v' p is indeed a weak Markov kernel with respect to v(P~). 
By (HD 

i£(i/(P)(A)) = / 4(wi, il)i/(P)(dwi) = / v(u,!li)P(du) = P(A). 

JQ! J A 

We see that sufficiency of the kernel v, in contrast with Blackwell sufficiency, 
implies the existence of a weak Markov kernel v' , such that v'(v(P)) = P holds for 

P e V. 

7.3. Pairwise sufficiency. We say that a subalgebra (Markov kernel) is pair-wise 
sufficient for V, if it is sufficient for any pair {Pi, P2}, Pi, P2 6 V. Clearly, a 
sufficient subalgebra (Markov kernel) is pairwise sufficient. We have the following 
characterization of pairwise sufficient Markov kernels. 

Theorem 7.5. [MIEKS. Kullback, R.A. Leibler, T. Csiszar). Let (tt,A), (fli,A{) 
be measurable spaces, v : £1 x A\ — > [0, 1] a Markov kernel and P, Q £ (Q, A) . 
Then the following are equivalent. 

(i) v is sufficient for {P, Q}. 

(ii) For any convex function f on K + one has 

(27) D f (v(P),v(Q))=D f (P,Q). 

(iii) There is a strictly convex function f on R + such that 

(28) D f (v(P),v(Q)) =D f (P,Q) <oo. 

Note that we may take the Hellinger distance H(P, Q) in (iii). 

7.4. Dominated families. Let V C M+(fl,A). We say that V is a dominated 
family, if there is a a-finite measure [i such that V <C /i. If this is the case, then 
we can find a finite measure [1q, dominating P. It is clear that if V <C \x, then we 
have O(P) <C /x, where 

C(P) - {J2 ^nP n : A„ > 0, A n = 1, P« 6 P}- 

n n 

If we also have fx(A) = whenever P(A) = for all FgP, then we write V ~ \i. 
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Lemma 7.6. [19] LetV be a dominated family. Then there is a convex combination 
Pq = J2 n ^nPn of elements of P n £ V, n G N, such that V ~ Pq. 

The following theorem is well known. 

Theorem 7.7. [HE [3D] Let V C M^~(Q,,A) be a dominated family and let P G 
C(P) be such that V ~ Pq. Le£ C .4 be a sub-a -algebra. Then the following are 
equivalent. 

(i) A\ is sufficient for V . 

(ii) A\ is pairwise sufficient for P. 

(hi) A\ is sufficient for the pair {P, Pq} for every P G P. 

A similar statement holds also for Markov kernels. Since the proof is not easy 
to find in the literature, we give it here. We will hrst show that we can describe 
sufficient Markov kernels in terms of sufficient subalgebras. 

Let v : ft x A\ — > [0, 1] be a weak Markov kernel with respect to V C M^(fl, A). 
ForP G L(V)nM+{n,A) we define a probability measure P x v G 
A), by 



Note that we have P x z/(fi x B) = v{B) and P x v{A X O x ) = P(A) for Aei, 
P € A- 

Lemma 7.8. [H] Lef V C (O, A) and letv : SlxAi -> [0, 1] &e a Markov kernel. 
Then v is sufficient for V if and only if the sub-a -algebra Aq = {0, n}(8A C .4®A 
is sufficient for {P x v : P G V}. 

Proof. Let v be sufficient and let v' be the corresponding weak Markov kernel, see 
Remark [7~4l For AeA,B€ Ai, we define a function /axb : fi x ^i — » [0, 1] by 



It is clear that //ixs is A" me asurable, moreover, for B\ G and P G P, 



It follows that fAxB is the common version of the conditional probability /axB = 
Pxi/(Ax P/A), P x z/ a.e., for all P G P. 

Conversely, suppose that .4o is sufficient for {P x v : P G P} and let f ay.b = 
P x v(A x B/Aq), P xv a.e. for all P G P. Then, since /axb is A-measurable, 
it depends only from oj\. Put v'(u>i,A) = /axsIi(^i), then for B G A\ and P G P, 

[ i/(ui,A)v(P)(dui)= f f Axni d{Pxv) = Pxv{AxB)= f v(lu, B)P(duj), 

JB JQxB JA 

so that z/ = v'p, v(P) - a.e. □ 

Theorem 7.9. Let V C M^~(Q,A) be dominated and let Po G C(P) 6e suc/i that 
V ~ p). £e£ z/ : f2 x — » [0, 1] 6e a Markov kernel. Then v is sufficient for V if 
and only if v is sufficient for {P, Pq} for every P G P. 



(29) 





P x v{A x B n n x Bi). 
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Proof. Let us denote 

V x v := {P x v : P e P}. 

By Lemma [7.81 z/ is sufficient for all {P, Po} if and only if the sub-cr-algebra _4 
is sufficient for all {P x i>, Pq x ^}. Clearly, Po x v G C(P x j/) and if we have 
? x 1/ ~ Fo x !/, then, by Theorem l7.7l „4o is sufficient for P x v and therefore ^ is 
sufficient for V . It is enough to prove that {P x v : P £ P} is dominated by Po x ^. 
For this, fix e > and let A e A, B E Ai- By Kolmogorov inequality, 

(30) P Xi/(ixB) = / v(oj, B)P Q (duj) > kP (A n {z/(w, P) > fc}) 

for all k > 0, moreover, since ^(w, P) < I, 

Pxv(AxB) = I v>(uj,B)P(du) = / v(uj,B)P((kS) 

J A JAn{v(u>B)>k} 

+ j v{uj,B)P(duj)<P(An{v{uj,B)>k}) + k. 

Since P < P , there is some 8 > 0, such that P(A) < e/2 if P (A) < 25/e. Put 
= e/2 in (O, then P x x P) < 8 implies that P H P) > e/2}) < 25/e 
and P x v(A x B) < e. 

Let now C G .4 (g> -4i be such that Po x ^(C) < 8/2. Then, since C can 
be approximated by rectangles, there are some A G A and P G *4i, such that 
A x B D C and P x v{A x B) < 8. This implies that P(C) <Pxv(ixB)<e. □ 

A comparison of Blackwell sufficiency and sufficiency is given in the following 
theorem ([HI Theorem 22.11]). 

Theorem 7.10. Let (£l,A, V) be an experiment, (fli,A\) a measurable space and 
v : fi x A\ — ► [0, 1] o Markov kernel. 

(i) // (0,^4, P) is fi -dominated by a a-finite measure fj, on (f2,*4) and v is 
Blackwell sufficient, then v is sufficient. 

(ii) If (VL\, A\,v(P)) is \i\-dominated by a a-finite measure fit, (ft, A) is a 
standard Borel space and v is sufficient for V , then v is also Blackwell 
sufficient. 

Proof, (i) can be proved from Theorems 17.11 17.51 and 17.91 (ii) follows from Remark 
O and Theorem O □ 

We can list the results of the present section as follows. 

Corollary 7.11. Let V G (fi, Ax) be a dominated family and let Po G C(P) be 
such that V ~ Po. Let v : fi X A\ — ► [0, 1] be a Markov kernel. Then the following 
are equivalent. 

(i) v is pairwise sufficient for P. 

(ii) v is sufficient for {P, Po} for each P G P. 

(iii) For all P G V and all convex functions f : [0, oo) — > R, we have 

D f (P,P ) = D f (v(P),v{P )). 

(iv) There is a strictly convex function f : [0, oo) — > R, such that for all P G P 

D f (P,P )=D f (v(P),v(P )) <O o. 

(v) v is sufficient for V . 



22 



Jencova, A. Pulmannova, S., Vincckova, E. 



(vi) There is a weak Markov kernel v 1 : Qi x A\ — > [0, 1] with respect to \y(P) ■ 
P G V}, such that v'(v(P)) = P for all P G V. If (SI, A) is a standard 
Borel space, then v' is a Markov kernel and v is Blackwell sufficient for V . 

Proof. The equivalence of (i) -(v) follows directly from our previous results, (v) =>• 
(vi) follows from the Remark 17.41 The implication (vi) => (v) follows from 
Theorems O and EH □ 

7.5. Application to fuzzy quantum observables. Recall that if Too is a faithful 
state on L then, for every state to and every observable £ on L, it holds to o £ <c 
mo o £. Therefore V = {to o £ : to G S} is a dominated family, with V ~ too o £. 
Applying Corollary 17. Ill and Theorem 17. 101 we obtain the following theorem. 

Theorem 7.12. Let L be a a-orthocomplete effect algebra with an order deter- 
mining system S of a -additive states, £ and n be real observables on L such that 
£ ^ r\ with a confidence measure v , and let there exist a faithful state too G S. The 
following conditions are equivalent. 

(i) v is pairwise sufficient for {to o £ : m G S} . 

(ii) v is sufficient for {to o £, too o £} for all m G S. 

(hi) for all to G 5 and aZZ convex functions f : [0, oo) — > M, we /iaue 

Df(m o £, to o £) = Df(u(m o £), z/(m o £)). 

(iv) There is a strictly convex function f : [0, oo) — ► K smc/i t/iai /or all m £ S 

Df(m o £, too o £) = Df (v(m o £), i/(toq ° £)) < oo- 

(v) ^ is sufficient for {to o £ : to G 5}. 

(vi) ^ is Blackwell sufficient for {to o £ : to G 5}. 

(vii) £ ~ 77. 
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